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vThe  Computerized  File  Organization  Design  System  is  one  of  a set  of  database 
' design  aids  developed  at/the  Management  Information  Systems  Research-Center, 
at  th'  University  of  Minnesota.  :\This  program  is  used  to  automatically 
gene- ate  an  efficient  database  design  for  a single  record  type  in  a multi- 
user environment.  Given  a quantitative  problem  specification,  an  efficient 
combination  of  access  paths  is  determined  for  a collection  of  segmented 
recirds.  This  document  describes  the  use  of  the  Design  System. 
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1.  INTRODUCTION 

The  computerized  database  design  system  contains  two  optimizing  design 
procedures: 

(1)  Record  Segmentation  Optimization,  and 

(2)  File  Organization  Optimization. 

Duhne  [1,  2,  3]  developed  a file  organization  design  procedure  which  is  capable  of 
automatically  selecting  an  efficient  file  organization  for  database  design  problems 
in  which  multiple  database  users  retrieve  subsets  of  entire  data  records  from  a 
single  set  of  stored  data  records.  Data  records,  however,  are  commonly  composed 
of  both  "high  usage"  and  "low  usage"  data  items  (it  is  generally  believed  that 
eighty  percent  of  all  database  retrieval  is  directed  at  only  twenty  percent  of  the 
stored  data  items).  For  operating  efficiency,  therefore,  it  is  desirable  to  par- 
tition the  data  items  stored  within  each  record  into  a primary  record  segment 
retrieval  by  all  users  and  a secondary  record  segment  retrieval  at  extra  cost  only 
when  required.  Eisner  and  Severance  [4]  developed  a procedure  to  determine  a record 
segmentation  which  minimizes  the  total  system  operating  cost. 

In  the  computerized  database  design  system,  Duhne' s file  organization  design 
procedure  is  modified  to  solve  database  design  problems  with  a given  record  segmen- 
tation. This  record  segmentation  is  either  specified  by  an  analyst  or  determined 
by  the  record  segmentation  procedure. 

The  operation  of  the  database  design  system  is  illustrated  in  Figure  1. 

Initially  a problem  is  defined  for  both  the  record  segmentation  procedure  (see  [5]) 
and  the  file  organization  design  procedure  (see  [2]).  Care  must  be  taken  to  insure 
that  the  problem  definition's  are  consistent.*  The  record  segmentation  procedure  is 
then  invoked  and  an  analyst  may  either  define  a record  segmentation  or  use  the 
algorithm  to  determine  an  efficient  record  segmentation  for  the  defined  problem. 

*In  a later  version  of  the  database  design  system  the  problem  definitions  for 
each  procedure  will  be  obtained  from  a single  problem  specification. 
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The  selected  segmentation  is  passed  to  the  file  organization  design  procedure 
(via  a file  called  DBSEG)  which  is  invoked  next.  This  procedure  consists  of  four 
subprocedures  and  six  data  files  as  illustrated  in  Figure  1.  The  selected  data- 
base design  may  be  displayed  at  a terminal  or  output  to  a line  printer.  After  the 
results  have  been  examined,  the  analyst  may  select  a different  record  segmentation 
or  modify  the  problem  definitions  and  reinvoke  the  design  procedure.  The  following 
section  describes  the  user  interaction  with  the  design  procedure. 

2 . AN  EXAMPLE  OF  USER  DIALOG. 

This  section  presents  an  annotated  user  dialog  to  illustrate  the  use  of  the 
database  design  system.  In  the  example  a user  first  retrieves  a previously  stored 
record  segmentation  problem  definition  (see  [5])  and  evaluates  a user  defined  seg- 
mentation. The  record  segmentation  algorithm  is  then  invoked  to  determine  an 
optimal  record  segmentation  for  the  design  problem.  This  segmentation  is  saved 
for  the  host  file  optimization  procedure  and  the  record  segmentation  procedure  is 
ended. 

The  user  then  invokes  the  file  organization  design  procedure,  again  with  a 
previously  stored  problem  definition  (corresponding  to  the  problem  analyzed  by  the 
record  segmentation  algorithm).  An  optimal  file  organization  design  is  determined 
given  the  record  segmentation  determined  by  the  record  segmentation  procedure. 

In  the  annotated  user  dialog  presented  below  capital  letters  designate  user- 
computer  dialog.  Lines  typed  by  the  user  begin  with  either  a question  mark  or  a 
slash  (supplied  as  a prompting  character  by  the  system).*  The  user  terminates  an 
input  line  with  a carriage  return  (or  ETX,  depending  on  the  computer  system). 

The  present  dialog  corresponds  to  a CDC  CYBER  74,  KRONOS  operating  system. 
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In  order  to  use  the  design  system,  a user  initially  logs  on  through  an 
operating  system  and  enters  procedure  call  mode  (BATCH  mode  in  the  KRONOS  operating 
system);  the  user  types  BATCH,  the  system  responds 

$RFL, 20000.  (The  system  acknowledges  the  command.) 

/-DBSYST  (The  procedure  -DBSYST  readies  the  computer  system 

to  accept  other  database  system  commands.) 

/-DBDSGN  (The  user  calls  the  database  design  procedure.) 


RECORD  SEGMENTATION  ALGORITHM 

BEGIN  ALL  RESPONSES  IN  COLUMN  1 (The  record  segmentation 

REQUEST  BRIEF  MODE  (YES  / NO)  procedure  is  entered) 

? YES 

REQUEST  PREVIOUSLY  STORED  PROBLEM  (YES  / NO) 

? YES 


REQUEST  DISPLAY  OF  INPUT  (YES  / NO) 
? NO 


REQUEST  CORRECTIONS  TO  INPUT  (YES  / NO) 
? NO 

REQUEST  PROBLEM  STORED  (YES  / NO) 

? NO 

REQUEST  BLOCKED  PRIMARY  FILE  (YES  / NO) 
? YES 

ENTER  BUFFER  SIZE  (CHARACTERS) 

? 3170 


(NOTE  this  buffer  size  must  be 
the  same  as  the  length  of  a memory 
block  for  the  data  area  in  the 
file  organization  problem  description.) 
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ENTER  SEGMENTATION  OPTION  OPTIMAL  / PARAMETRIC  / BOTH  / DEFINE  / END 
? DEFINE  (A  segmentation  is  defined  by  the  user.) 

ENTER  RECORD  SEGMENTATION  VECTOR  (1  = PRIMARY,  2 = SECONDARY) 
711112222121212121  1122221211211 


FOR  C .GE.  0 (The  user  defined  segmentation 

is  automatically  evaluated  and 

DATA  ELEMENTS  IN  PRIMARY  displayed.) 

1-  4,  9,11,13,15,17-19,24,26-27,29-30 

SUBSET  OF  DATA  ITEMS  DELETED 

5-  8,10,12,14,16,20-23,25,28 

USERS  WITH  POSITIVE  PROBABILITY  OF  BEING  DISSATISFIED 
1-18,20 

REQUEST  DISPLAY  OF  PROBABILITIES  FOR  ABOVE 
DISSATISFIED  USERS  (YES  / NO) 

? NO 

COST  = 1532.96 

ENTER  SEGMENTATION  OPTION  OPTIMAL  / PARAMETRIC  / BOTH  / DEFINE  / END 
? OPTIMAL  (An  optimal  segmentation 

is  determined.) 


OPTIMAL  SOLUTION  FOUND 


FOR  C .GE.  .0143 

DATA  ELEMENTS  IN  PRIMARY 

1-  4,  9,11,13,15,24,26-27 


(The  optimal  segmentation 
is  evaluated  and  displayed.) 


SUBSET  OF  DATA  ITEMS  DELETED 
17-19,29-30 


USERS  WITH  POSITIVE  PROBABILITY  OF  BEING  DISSATISFIED 
1-  2,  4,12,17-18,20 

REQUEST  DISPLAY  OF  PROBABILITIES  FOR  ABOVE 
DISSATISTIED  USERS  (YES  / NO) 

? NO 

COST  = 336.38 
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REQUEST  SENSITIVITY  ANALYSIS  (YES  / NO) 

? NO 

SEGMENTATION  OPTION  FOR  HOST  FILE  OPTIMIZATION  CURRENT  / DEFINE 
? CURRENT  (The  current  segmentation,  i.e., 

the  optimal,  is  saved  for  the  file 
organization  evaluation.) 


REQUEST  RECORD  SEGMENTATION  ALGORITHM  FOR  A NEW 
PROBLEM  ( YES  / NO) 

? NO 

RECORD  SEGMENTATION  ALGORITHM  ENDED 
STOP 
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(The  file  organization  design 
procedure  is  automatically 
entered. ) 
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REQUEST  AUTOMATIC  GENERATIONS  OF  PARAMETERS  FOR  ALL  HOST  FILES  (YES/NO) 
?YES  (The  user  wants  automatic  optimization) 


ENTER  RECORD  SEGMENTATION  OPTION 
UNSEGMENTED  / FROM  SEGMENTATION  MODULE 

?FROM  SEGMENTATION  MODULE  (The  record  segmentation  previously 

saved  is  used  in  the  design  optimi- 
zation. Optionally  the  user  could 
have  selected  UNSEGMENTED  records 
for  the  design. ) 


(The  system  then  outputs  some  information  as  the  optimization 
procedure  progresses.  First  the  host  file  names,  then  minimum  cost 
in  each  interaction,  finally,  when  the  solution  is  reached  it  is 
stored  in  file  DBSOLN,  and  a display  program  is  called  to  permit  the 
user  to  analyze  the  solution.  The  system  outputs  page  zero:) 

ENTER  PAGE  NUMBER  YOU  WANT  / (CARRIAGE-RETURN)  FOR  NEXT  PAGE  / (Q)  TO  QUIT 

PAGE  0. NEXT : 

? (Carriage-Return)  (The  user  wants  to  see  the  first  page)  (The  first 
part  of  the  solution  summary  is  displayed.  The  output  is  divided 
into  "pages"  which  are  not  larger  than  the  size  of  a CRT  display. 

The  bottom  two  lines  read:) 

PAGE  L.  NEXT:  (The  system  indicates  that  the  displayed  page  is  number  one 
and  asks  for  the  next  page  to  be  displayed) 

? (Carriage-return)  (After  analyzing  the  displayed  page,  the  user  wants  the 
next  page) 

(Page  two  is  displayed.  The  user  continues  analyzing  the  next 
pages.  The  user  is  allowed  to  jump  among  pages.  Let’s  say,  for 
example,  that  the  current  page  is  page  5,  and  the  user  wants  to 
go  back  to  page  1,  he  types:) 

PAGE  5.  NEXT: 

?1  (The  user  wants  page  1 to  be  displayed  next) 

(Page  1 is  then  displayed  again.  When  the  user  finishes  analyzing 
solution  he  types:) 

PAGE  1.  NEXT: 
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?Q  (Quit:  end  display) 

WANT  TO  QUIT  (YES/NO)  (The  system  asks  again) 

?YES 

ENDISPLAY  (The  system  leaves  the  display  program). 

See  Duhne  [2]  for  a detailed  description  of  the  output  produced  and  additional 
details  of  the  file  organization  design  procedure.  See  [5]  and  Eisner  and  Severance 
[4]  for  details  of  the  record  segmentation  procedure.  System  documentation  and 
complete  listings  of  all  programs  used  in  this  database  design  procedure  are  found 
in  [6]. 
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